A Probabilistic Inference System for The Prediction of Subcellular Localization Sites of Proteins: Application to E. coli Data
نویسنده
چکیده
We have proposed that the prediction of protein subcellular localization sites can provide a good clue for the characterization of open reading frames of unknown function [1, 2]. Our program, PSORT, has been used by a number of researchers through the Internet [3]. PSORT was originally written in the style of a 'if-then' rule-based system. Although this style has the merit of a great versatility in coding inference pathways, re-optimization of numeric parameters for either given training data or expanded rules needs an expert's manual work. Clearly, this character is not well suited for the rapidly-progressing state of genome analyses. Thus, we have been studying other mathematical models for inference that allow at least semi-automatic optimization of numeric parameters. Last year, we introduced a simple model, the waterow model, that automatically nds required threshold parameters [4]. This model showed su ciently high discrimination power for model data. Here, we describe an improved model and report its predictability when applied to more realistic data. We rst collected E. coli amino acid sequences of known subcellular localization sites from the PROSITE database (Rel. 31). Excluding hypothetical information, 336 sequences were collected in total. They were classi ed into the following 8 groups: lipoproteins at the inner membrane (imL), lipoproteins at the outer membrane (omL), inner membrane proteins with a cleavable signal sequence (imS), typical outer membrane proteins (om), periplasmic proteins (pp), inner membrane proteins with a signalanchor 2Cf0f8,B@!’Bg:eBg3X:YK&@8BN9)3X%;%s%?!<!$") 565 ?aED;T;3ED5V 1-3
منابع مشابه
Application of Artificial Neural Network and Fuzzy Inference System in Prediction of Breaking Wave Characteristics
Wave height as well as water depth at the breaking point are two basic parameters which are necessary for studying coastal processes. In this study, the application of soft computing-based methods such as artificial neural network (ANN), fuzzy inference system (FIS), adaptive neuro fuzzy inference system (ANFIS) and semi-empirical models for prediction of these parameters are investigated. Th...
متن کاملA Novel Type-2 Adaptive Neuro Fuzzy Inference System Classifier for Modelling Uncertainty in Prediction of Air Pollution Disaster (RESEARCH NOTE)
Type-2 fuzzy set theory is one of the most powerful tools for dealing with the uncertainty and imperfection in dynamic and complex environments. The applications of type-2 fuzzy sets and soft computing methods are rapidly emerging in the ecological fields such as air pollution and weather prediction. The air pollution problem is a major public health problem in many cities of the world. Predict...
متن کاملApplication of an Adaptive Neuro-fuzzy Inference System and Mathematical Rate of Penetration Models to Predicting Drilling Rate
The rate of penetration (ROP) is one of the vital parameters which directly affects the drilling time and costs. There are various parameters that influence the drilling rate; they include weight on bit, rotational speed, mud weight, bit type, formation type, and bit hydraulic. Several approaches, including mathematical models and artificial intelligence have been proposed to predict the rate o...
متن کاملPSORT-B: improving protein subcellular localization prediction for Gram-negative bacteria
Automated prediction of bacterial protein subcellular localization is an important tool for genome annotation and drug discovery. PSORT has been one of the most widely used computational methods for such bacterial protein analysis; however, it has not been updated since it was introduced in 1991. In addition, neither PSORT nor any of the other computational methods available make predictions fo...
متن کاملMolecular Characterization of the Epstein-Barr Virus BGLF2 Gene, its Expression, and Subcellular Localization
Background: Epstein–Barr virus (EBV) is a universal herpes virus which can cause a life-long and largely asymptomatic infection in the human population. However, the exact pathogenesis of the EBV infection is not well known.Objective: A comprehensive bioinformatics prediction was carried out for investigating the molecular properties of the BGLF2 and to a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997